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Abstract. We develop a general solution for the Fokker-Planck (Kolomogorov) 
equation representing the diffusion limit of the Wright-Fisher model of random 
genetic drift for an arbitrary number of alleles at a single locus. From this solu- 
tion, we can readily deduce information about the evolution of a Wright-Fisher 
population. 

1. Introduction 

The random genetic drift model developed implicitly by Fisher in [3] and explic- 
itly by Wright in [11] , and henceforth called the Wright-Fisher model is one of the 
most popular stochastic models in population genetics ([HH]). In its simplest form, 
it is concerned with the evolution of the probabilities between non-overlapping gen- 
erations in a population of fixed size of two alleles at a single diploid locus that 
arc obtained from random sampling in the parental generation, without additional 
biological mechanisms like mutation, selection, or a spatial population structure. 
Generalizations to multiple alleles, several loci, inclusion of mutations and selection 
etc. then constituted an important part of mathematical population genetics. It is 
our aim to develop a general mathematical perspective on the Wright-Fisher model 
and its generalizations. In the present paper, we treat the case of multiple alleles 
at a single site. In a companion paper [10], we discuss the simplest case of 2 alleles 
in more detail. Generalizations will be addressed in subsequent papers. 

Let us first describe the basic mathematical contributions of Wright and Kimura. 
In 1945, Wright approximated the discrete process by a diffusion proccssthat is con- 
tinuous in space and time (continuous process, for short) and that can be described 
by a Fokker-Planck equation. In 1955, by solving this Fokker-Planck equation de- 
rived from the Wright-Fisher model, Kimura obtained an exact solution for the 
Wright-Fisher model in the case of 2 alleles (see [1]). Kimura ([S]) also developed 
an approximation for the solution of the Wright-Fisher model in the multi-allele 
case, and in 1956, he obtained ([6]) a exact solution of this model for 3 alleles and 
concluded that this can be generalized to arbitrarily many alleles. This yields more 
information about the Wright-Fisher model as well as the corresponding continu- 
ous process. Kimura's solution, however, is not entirely satisfactory. For one thing. 
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it depends on very clever algebraic manipulations so that the general mathemati- 
cal structure is not very transparent, and this makes generalizations very difficult. 
Also, Kimura's approach is local in the sense that it does not naturally incorporate 
the transitions resulting from the (irreversible) loss of one or more alleles in the 
population. Therefore, for instance the integral of his probability density function 
on its defined domain is not equal to 1. 

In this paper, we derive the formalism for a general solution that naturally 
includes the transitions resulting from the disappearance of alleles. The key are 
evolution equations for the moments of the probability density. We show that there 
exists a unique global solution of the Fokker-Planck equation. We then utilize this 
solution to derive properties of the underlying process, like the expected transition 
times. 



2. The global solution of the Wright-Fisher model 

In this section, we will establish some notations, and then prove some proposi- 
tions as well as the main theorem of this paper. 

2.1. Notations. A„ := {{x^,x'^, . . . , a;""'""'^) : J27=i ~ 1} standard n— simplex 
in R"+^ representing the probabilities or relative frequencies of alleles Ai, . . . , An+i 
in our population. Often, however, it is advantageous to work in R" instead of R""'""'^, 
and with eo := (0, . . . , 0) G R", efc := (0, . . . . . . , 0) S R", wc therefore define 

:= intco{eo, . .. ,e„} := |X!a;''efe, {x,x°) = ^x\ . . ■ . a;", 1 - ^ x*^^ € intA„| . 

Moreover, we shall need the subsimplices corresponding to subsets of alleles, using 
the following notations 

h ■= {{ioi ■ ■ ■ ,ik} , < io < ■ ■ ■ < ik < n} , k e {!,..., n}, 
Vq := {eo,...,e„}, 

the domain representing a population of one allele, 

Vfc^'"'-''"^ :=intco{e,„,...,e,J, ke {l,...,n}, 

the domain representing a population of alleles {Ai^, . . . , Aif_}, 
Vk {intco{e^o, . . . , J for some io < • • • < ifc G 0, n} , fc g {1, . . . 

{io,...,ik)&Ik 

the domain representing a population of (fc -f- 1) alleles, 
Vk-.- U ^^''^^ fcG{l,...,n}, 

k 
i=0 

the domain representing a population of at most (fc + 1) alleles. 
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We shall also need some function spaces: 



Hr,:=C^{Vk), fee {!,..., n}, 

H := {/ : Vn [0,00] measurable such that [f,g]n < oo,Vg G i/„} , 
where [/,g]„ := J f{x)g{x)dfi{x) =^ J f{x)g{x)dfik{x), 

fc=0 (io,...,jfc)6/fc^^(io,...,ij^) 

with ^J^'o a probability measure on y^^'O' - '**), 
We can now define the differential operators for our Fokker-Planck equation: 



k ■ k k ^ k > 2 ^ dx'dx^ 



ijG{il,...,ifc} 



Lk : Hk ^ Hk, (Lfc)|^(io,.-.,>fc) = i 



(io,...,ik)x* 



where the coefficients are defined by 

aij{x) := x\5ij-x'), i, j G {1, . . . , n}. 
Finally, we shall need 

W x\ fcG{f,...,n}. 



Proposition 2.1. For eac/i f < fc < n, to > 0, |a| = + • • • + a'^ = to, i/ie 
polynomial of degree m in k variables x = (a;*^ , • ■ • , ^j'* ) in vj^ 



(io,---,ik) 



(2.1) xwjx) = x"+ E «LV' 

|/J|<m 

where the ^ are inductively defined by 

''™-'^"~(TO-|/3|)(TO + ^ + 2A; + f)' ^1^1 <"^' 
is the eigenvector of L^^°'""^''^ corresponding to the eigenvalue Am'' = ('"+^)(™+^+^) _ 
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Proof. We have 



)=5 T. (;^h-')("^°+i:''"V) 

ie{ii,...,ik} 



\P\<rn 



E 



i¥'je{il,---,ik} 



l/9|<r 



2 ^ (9x')2 

i6{ii,...,ifc} 



+ E -tV^' 



-E« 

|/}|<m 

E 

^¥'j<^{^l,■■■,^k} 



92 



+ E -tV^'^"-'^ 

\f}\<m 



|/J|<m 



|/3|<m 



E 

1 7^ J 



|/3|<m 



1 ^(a^ + 2)K + 1) - + + 1) 



terms of lower degree 

-^(E"' + fc)(E"^ + ^+i 



+ terms of lower degree 



[m + k){m + k + 1) „ ri , 
a; + terms of lower degree. 



By equalizing coefficients we obtain 



X (fc) ^ {m + k){m + k + 1) 
2 



and 



E(ft + 2)(A + l)a|„V 



(k) i=l 

al'n = 



(m-|/3|)(m + ,3 + 2fc+l)' 
This completes the proof. 



m < 



□ 



Remark 2.2. When A; = 1, Xm,m(a;^) is the m*''— Gegenbauer polynomial (up to a 
constant). Thus, the polynomials Xm}aix) can be understood as a generalization 
of the Gegenbauer polynomials to higher dimensions. 
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Proposition 2.3. If X e l/f'o...,*^) 

is an eigenvector of L^^"''"'^^^ corresponding 
to A then w^*'''"''*'"-'^ is an eigenvector of (L^i^"''"'^''^)* corresponding to A. 



Proof If X e V^'° is an eigenvector of corresponding to A, it follows 

that 

i,je{ii,...,ik} 



t,je{ti....,tk} 
, 1 (.0,.....). ^ V dX 

=it .))(.<» -W^) 

je{ii,...,ik} 

ie{ii ,...,ifc} 
_ fc(fc+ l) ^J»0,...,jfc), 

2 



4 E (-'(%-^^))K"-'"^(-) 



2 / ^ V " 'dx'dxi 

«j'6{«l,...,«fc} 

"2 ^ ^ ^'^'^ >> dx^dxi 

ij'e{ii,...,ifc} 

This completes the proof. □ 
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Proposition 2.4. Let v he the exterior unit normal vector of the domain y^^'", ■■■,«(=) ^ 
Then we have 

(2.2) a^^'^O on dvl^°-''''\ V^ G {ii, . . . , z^}. 

j6{ii,...,jfc} 

Proof. In fact, on the surface {x" = 0), for some s € {ii, . . . ,ik} we have v = ~es, 
and hence ^ ciijiy^ = a,,j = x'^{Ssi — a;*) = 0. On the surface {x^° — 0) we 

je{ii,...,ik} 

have = -i=(eii +. • - + 61 J, hence J2 ^ Vk ^ "^-J' ^ 7^^'^'° 

0. This completes the proof. □ 

Proposition 2.5. and are weighted adjoints in 

1. e. 

Proo/. We put F^^''\x) := E ^^""i^^"" . Because of £ Co°°(Fi"''-'''=^), 

the second Green formula, and Proposition 12.41 we have 



i,je{»i,...,«fc}-^^(,o,...,.fc 



»e{«i,...,ifc}_(^,Q,...,.^) 



1 ^ ^ ,<«,.,M^;^,. 



i6{ii,...,ifc}-^(,o......fc) 



^4 S / 

«e{»i,...,u-}_uo....,,,) 

^ f 9(a,,-(x)X(x))a(u;fa--'"-)(x)y(x)) ^^ 
«jeW,...,jfc}_(^.„ 

==--2^ y a.y(x)i/jX(x) ^- do(a;) 

= (x,L*Ky)). 

□ 
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Proposition 2.6. InVk'''"""'\ \ X^}A is a basis oj H^°'"''^''^ which 

^ J m>0,|Q|— m 

is orthogonal with respect to the weights 10^/^° i.e., 

Proof, {x^^l} is a basis of H^^"--''"^ because {a;"}^ is a basis of this 

^ J m>0,|a|— m 

space. To prove the orthogonality we apply the ProDOsitions l2.11 12.31 12. 71 as follows 

(fc) (k) 

Because Am ¥^ \ this finishes the proof. □ 
Proposition 2.7. (i) The spectrum of the operator is 
SpeciLt--^) = U |aL^) = (!!l±M^±i±I) I 

m>0 ^ 

and the eigenvectors of L^^°'"'"''''^ corresponding to Am"* are of the form 

\a\—rn 

i.e., the eigenspace corresponding to Am^ are of dimension ('^^™||~^); 
(ii) The spectrum of the operator Lk is the same. 

Proof. (i) Proposition 12.11 implies that A^. C 5pec(L^*°' ' '*'°-'). Conversely, for 
A ^ Afc, we will prove that A is not an eigenvalue of 2^^*o,- -,*fc)^ fact, 
assume that X £ such that l'}!''--"'^ X = -AX in 

Because IXm^a \ is an orthogonal basis of with respect to the 

weights wfr " ^ (Proposition l2.4p . we can represent X by X = J2 J2 dni,aXm']a 

m=0 |a|=m 

It follows that 

rn=0|Q|=m m=0|a|=m 

^(*o,---,'ifc) 

= - A V V d^^) 

7n—0 \a\—m 
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For any j > 0, = j, multiplying by WkXj^'p and then integrating on Vn 
we have 

E d^l^' (^S'-^'-"^^^S) = E dSl^ i^Sl^-^t-^'-'x^i) -VJ > 0, 1/3 

=>43Af ) = dgA, Vj > 0, m = J, because det ^ ^ ^ 

^^('2 = 0, Vj > 0, |a| ^ j, because A ^ A^- 



if^ = 0, Vj > 0, |a| = j, because A ^ Af \ 

It follows that X = in Hj^'>'—^>'\ Therefore 



m>0 

Moreover, assume that X e fj(^o----,ik) eigenvector of x^^*"' - ''*^) corre- 

sponding to Aj'^\ i.e., = — AjX. We represent X by 

oo 

^ = E E ^Ml- 

m=0 |Q|=m 



It follows that 



V V ('_aW^Y('') -V V r(»o,...,«fc)T^(fe) 

in=0|Q|=m r)i=0 |Q!|=m 

^(*o,---,'ifc) 

oo 

= _ A*''' V V d^^') x^"^) 

rn=0 |Q|=m 

For any i ^ j, \/3\ = i, multiplying by WkXf^ and then integrating on Vn 
we have 

E {X?:l,nli^-"^X^ = E ^S^f {x?:l^--"^X^i) ,V^^,,|/3| 

|q|— i |a|— z 



=>€i^' = ^ = ^ because det (x^, u;^-^''^^^) 7^ 



(fc) ^ 

=0, Vi ^ .7, |a| = i, because Af ^ Af ^ 
It follows that 



(fc) ^(fc) 

l"l=i 



This completes the proof, 
(ii) is obvious. 

□ 
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2.2. Definition of tiie solution. We shall now derive the Fokker-Planck equation 
as the diffusion limit of the Wright-Fisher model and our solution concept for this 
equation. We consider a diploid population of fixed size with n+1 possible alleles 
Ai, . . . , A„^i, at a given locus. Suppose that the individuals in the population 
are monoecious, that there are no selective differences between these alleles and 
no mutations. There are 2N alleles in the population in any generation, so it 
is sufficient to focus on the number Ym — {Y^, ■ ■ ■ , ^m) of alleles Ai, . . . , An at 
generation time m. Assume that Iq = *o = (*0'---'*o) ^^'^ according to the 
Wright-Fisher model, the alleles in generation m + 1 are derived by sampling with 
replacement from the alleles of generation m. Thus, the transition probability is 

nYm+i = j\Yn. = i)= (_^-o),Ji),.^(_^-„), n 
where 

J e 5(2iV) ^ 1^ ^ (^1^ . . . , z") : z'^ € {0, 1, . . . , 2iV}, < 27v| 

and 

i° = 2Ar - |i| = 2Ar - - . . . - i"; f ^ 2N - \j\ = 2iV - j". 

After rescaling 

m Yt 
*"2iV' ^*"2iV' 

we have a discrete Markov chain Xt valued in {0,2^,...,l} with t = 1 now 
corresponding to 2A^ generations. It is easy to see that 

An =p = , 



(2.3) nSXl) =0, 



E{SXl.SXi) =iXm, - X 



E(5At)" ={St) for |a| > 3. 

We now denote by ma{t) the a*''— moment of the distribution about zero at the t*'' 
generation, i.e., 

m„(t) =E(At)" 

Then 

rriait + l) =E(Af +(5Af)" 

Expanding the right hand side and noting (12.31) we obtain the following recursion 
formula, under the assumption that the population number N is sufficiently large 
to neglect terms of order and higher, 

(2.4) m„(t + l) = h-LJ^LJ lKm^(t) + }2^^ -m^^ed-t) 

^ ' i=i 

Under this assumption, the moments change very slowly per generation and we can 
replace this system of difference equations by a system of differential equations: 

(2.5) ma{t) = ma[t) + 2_^ TOa_e.(i). 

i=l 
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With the aim to find a continuous process which is a good approximation for the 
above discrete process, we should look for a continuous Markov process {Xtj^^Q 
valued in [0,1]" with the same conditions as p.3|) and (j2.5|) . Specially, if we call 
u(x,t) the probability density function of this continuous process, the condition 
(|2.3p implies (see for example [5], P- 137) that u is a solution of the Fokker-Planck 
(Kolmogorov forward) equation 



(2.6) 

and the condition 



LnU in X (0, oo), 
Sp{x) in Vn, 




= [u,L*(x")]„,Va, 



I.e., 
(2.7) 



[ut, 0]„ = [u, L*(/)], 



This leads us to the following definition of solutions. 

Definition 2.8. We call u G H a solution of the Fokker-Planck equation associated 
with the Wright-Fisher model if 



(2.8) 
(2.9) 
(2.10) 



Ut = LnU in Vn X (0, oo), 
u{x, 0) = dp{x) in Vn, 



\u, LI 



2.3. The global solution. In this subsection, we shall construct the solution and 
prove the existence as well as the uniqueness of the solution. The process of finding 
the solution is as follows: We firstly find the general solution of the Fokker-Planck 
equation (j2.8p by the separation of variables method. Then we construct a solution 
depending on certain parameters. We then use the conditions of (|2.9[ I2.10p to 
determine the parameters. Finally, we check the solution. 

Step 1: Consider on Vn, assume that u„(x, t) = X((x)T{t) is a solution of the 
Fokker-Planck equation (|2.8p . Then we have 



T 



LnX 



= -X 



Clearly A is a constant which is independent on T,X. From the Proposition (|2.7p 
we obtain the local solution of the equation (|2.8p of the form 

oo 



where 



uniK,t) = Y. E c(:,ui"L(x)e- 

rn=0 \ a\=m 



^(n) ^ {n + m){n + m + 1) 



is the eigenvalue of L„ and 

are the corresponding eigenvectors of i„. 
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For m > 0, |/3| = m, we conclude from Proposition (|2.3p that 
It follows that 

[ut,WnXl^^p]n = (^WnX^"]^^ (tlic moment Condition) 



v(n) 



Therefore 



Thus, 



= 'u;„(p)X^_)3(p)e 



,(n) 

1,0 



= (u„,w„X^'^)„ (because w„ vanishes on boundary) 



It follows that 



|ck|— m 



(n) 



a, 13. 



^„(p)X^'i(p) 



13 



Step 2: The solution u £ H satisfying (|2.8p will be found in the following form 

n n 

(2.11) m(x, i) = XI 1^^) + XI "o(x, t)'5e. (x). 



We use the condition (j2.10p to obtain gradually values of Ufe, /c = n — 1, . . . , 0. 
In fact, assume that we want to calculate u^^2^i'"'~^\x^ , ■ ■ ■ ,x"~^,0,t). 
We note that, if we choose 



(x)=xi...x"47i)(x\...,x"-i), 



k. 



then <7ii(x) vanishes on faces of dimension at most n — 1 except the face V^^^'i" ^■ 
Therefore, the expectation of (j) will be 

[u,(j)\n = {Un,(j))n + (wi°l'i '""^^ <^)n-l ■ 

The left hand side can be calculated easily by the condition (|2.10p 
(2.12) K,</)]„ = [u,£:(0)]„ = -A["-i)[w,0]„. 

It follows that 

[u,</)]„ = 0(p)e-^^"""*. 
The first part of the right hand side is known as 

=E<"( Xi"L(x)0(x)dxje-^S""'*. 
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Therefore we can expand u^n-i '^^ i^^^ ' ' ' ' x"'^^, 0, t) as follows 



m>0 



m>0 i>0 

Put this formula into Equation (|2.12p wc will obtain all the coefficients c^7a ■ 

It means that we will obtain w^^^l'^ ^\x^, ■ ■ • , 0, t). Similarly we will obtain 
u„-i. And finally we will obtain all u^., fc = n — 1, . . . , 0. It means we obtain the 
global solution in form 



(2.13) 

= E E E E cS,.4t'(x)e-^5"\v.(x) +E"o(x,t)^e.(x). 

m>0 ;>0 |Q|=i 1=0 

It is not difficult to show that w is a solution of the Fokker-Planck equation associ- 
ated with WF model. 

Step 3: We can easily see that this solution is unique. In fact, assume that ui, U2 
are two solutions of the Fokker- Planck equation associated with WF model. Then 
u = ui — U2 will satisfy 

Ut = LnU in Vn X (0, oo), 
u(a;, 0) = in Vn] 



It follows that 



[ut,i]„ = [u,i:(i)]„ = o, 

[ut,x^]n = [u,£*(a;*)]„ = 0, 

= -Af [.,^i;---)^5x^.o 

Therefore 

[m,1]„ = [m(-,0),1]« =0, 
[M,a;*]„ = [u(-,0),x']„ = 0, 

Since |l, {x*} . , {i^i* °'''''''°''^j,aXv't,('o. .■fc)}i<fc<nX»o,.../»fc)e/k j>oja|=j^ is also a ba- 
sis of J?„ it follows that u = G H. 
In conclusion, we have established 
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Theorem 2.9. The Fokker Planck equation associated with the Wright-Fisher 
model with (n + 1) — alleles possesses the unique solution 



(2.14) 

= E E E E <U4t'(x)e-^;""*X^.(x) +E-o(x,t)^e,(x) 

k=l m>0 i>0 |c(|=! i=0 

Example 2.10. To illustrate this process, wc consider the case of three alleles. 
We will construct the global solution for the problem 

^ =L2U, in 1^2 X (0,oo), 

u(x,0) =(5p(x), xeV2, 
[ut,4'h = [u, Ll(t>]2, for all (j) e H2, 

where the global solution of the form 

u = U2XV2 + ul'\v°'' + + Ui'°Xy»-« + "oXy„i + ulxv„^ + u'^Xv"- 

and the product is 



U2(l)\v2dx+ u°{\x^,0,t)<j){x^,0)dx^ + u°-'^{0,x^,t)(j){0,x^)dx^ 



V2 
V 2 Jo 

+ ^1(1, 0, 0) + uliO, 1, t)0(0, 1) + ?/[!(0, 0, 00(0, 0). 
Step 1: We find out the local solution U2 as follows 

U2 



(x,i)= V V c^'^ . .(x)e-^"*. 



??i>0 OL^ -{-oP'—'in 

To define coefficients c^/^i ^2 we use the initial condition and the orthogonality of 
eigenvectors 9 

= (m2(0), W2-'^^^^i ^2)2 because W2 vanishes on the boundary 
= V c'^' 1 sf^^^^i ,,W2^*^li for all + = m. 

Because the matrix 



(2) 

is positive definite then we have unique values of ^\ ^2 ■ It follows that we have 
a unique local solution U2. 

Step 2: We will use the moment condition to define all other coefficients of the 
global solution. 
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Firstly, we define the coefficients of u}'^ as follows 



(2.15) ul'\x\l-x\t)^ ^ c„(xi)e-^-'* 

m>0 

(2.16) = ^ c„,,x/^^(xi)e- 



m,l>0 

We note that 



Therefore 

[wt,xix2x^'^(xi)]2 = [u,L*(xix2x{.'^(a;^))]^ = -A^'^[u,a;ia;2x{/^(xi)]2 
It follows that 

Thus we have 

= {u2,x'x'xj^\x'))^+[iil-\x\l-x')xi'\x'))^ 
because x^x'^ vanish on the other boundaries 



m>0 



because of the orthogonality of (•, •)! with respect to wi 



m>0 m>0 



By equating of coefficients of e"* we obtain uj'^. Similarly we obtain ui. Then, 
we define the coefficients of Uq from the 1— th moment. 

Note that when 4> = x\ -^2(0) = 0: therefore [wt, (/)]2 = or 

[u,x']2 = [u{0),x^]^p\ 

It follows that 

P^ = [u, X^] = {U2,X^)2 + {Ui'^, X^)l + (u}'^, X^)i+ Wo(l, 0, t). 

Thus we obtain itj(l, 0, t). Similarly we have all uq. Therefore we obtain the global 
solution u. 

It is easy to check that u is a global solution. To prove the uniqueness we proceed 
as follows Assume that u is the difference of any two global solutions, i.e. u satisfies 

Ut = L2U, in V2 X (0,00), 

< w(x, 0) = 0, in V2 
Jut,(l)]2 ^ [u,L'^(f)]2, foran0ei/2- 

We will prove that 

(2.17) [w,0]2 = O V(/)ei/2. 



WRIGHT-FISHER MODEL 



15 



In fact, 

[Ut, 1]2 = [U, L*^{1)]2 = ^ \U, 1]2 = HO), 1]2 = 0, 
[Uux']2 - [U,L*^{X')]2 = ^ [li,.T*]2 = [u(0),.t']2 - 0, 

[uuw,{x^)x'^\x^)]2 = [u,Ll{w,{x^)x';^\x^))]2 = -\i'^u,w,{x^)X^^\x^)]2 

=> [u,Wiix')X^^\x')h = K0),«;i(:r*)xW(x*)]2e-^^'* =0, 
[ut,W2ix\x^)X^lix\x^)]2 - [ii,i;(^i;2(.T\x2)x(?)ja;i,x2))]2 = ~ A^^^ x^)^^?^^! , )] 3 

[u,W2{x\x')X^^l{x\x')]2 = [u{0),W2ix\x')xj^l{x\x^)]2e-^''^'' = 0. 
We need only to prove that Eq. p.l7p holds for all 

= (.t1)"(.t2)", Vm,n>0. 

(1) If n = 0, m > 0, we see that can be generated from {1, x^, 'Wi{x^)Xm\x^)}, 
therefore [m, 0]2 = 

(2) If m = 0, n > 0, we see that </> can be generated from {1, x^, wi{x'^) xin'^x'^)} , 
therefore [u,(j)]2 ~ 

(3) If 7T, = 1, m > 1, we expand (a;^)™~^ by 

ixr'^' = Y.c,xi'\x^). 

k>0 

Note that 

L;{x'x'xi'\x'))=-X^i^x'x'xi'\x') 

Therefore 

[ut,x'x^xi'\x')]2 = [u,L;(^x'x^xi'\x')'^]2 = -Xi'\u,x'x^xi'\x')]2. 



It follows that 

[u,x'x''xi'\x^)]2 = [ui0),x'x^xi'\x')]2e-^i'' =0. 
Therefore 

A:>0 

(4) If n > 2, m > 1 we use the inductive method in n. We have 

(xi)™(a;2)" = x^x^{x^ + x^ - l)(xi)'"-i(a;2)"-2 + {x^)^{l - x^){x^Y~^ 
= -?«2(a;\ a;2)(xi)'"-i(a;2)"-2 + - x^){x'y'-\ 

In the assumption of induction, we have 

[u,{x'y^{i-x'){x^r-% = o 

Then, we expand (a;i)™"i(.T2)"-2 

{x'y-'\xr-' = J:''^,c.x^U^\x'). 

Therefore 

[U, W2{x\x'){x'r-\x'r~% = Yl ^™?"[", «^2(x\ :r2)^(2)^(^l^ ^2)]^ ^ 0^ 

It follows that [u, (a;i)'" (a;2)"]2 = 0. 
Thus, u — 0. 
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3. Applications 

In this section, wc present some applications of our global solution to the evo- 
lution of the process {Xt)t>o such as the expectation and the second moment of 
the absorption time, the probability distribution of the absorption time for having 
k + 1 alleles, the probability of having exactly fc + 1 alleles, the a*'* moments, the 
probability of heterogeneity, and the rate of loss of one allele in a population having 
k + 1 alleles. We refer to read [2], [1], [5], [6], [7], [8], etc., which of these results 
are already known. 



3.1. The absorption time for having (fc + 1) alleles. Wc denote by T'^^i(p) — 
inf {t > : Xt £ Vk\Xo = the first time when the population has (at most) k + 1 
alleles. T^^l{p) is a continuous random variable valued in [0, oo) and we denote by 
4>{t,p) its probability density function. It is easy to see that Vk is invariant under 
the process {Xt)t>o, i.e. if Xg € Vk then Xt e Vk for all i > s (once an allele is 
lost from the population, it can never again be recovered). We have the equality 

F(T,^+^(p) < t) - e Vk\Xo ^P)^ f_ u{x,p,t)dti{x). 



It follows that 

f d 

Jv, dt 

Therefore the expectation for the absorption time of having fc + 1 alleles is (see also 
0, p. 194) 



nTiitlip)) = / m,p)dt 

Jo 

f f°° d 
Jo dt 

j=l(ia....,ijKIjm>0\a\=m •'^3 " ^ 

1=0 m>0 |a|=jn ^ ° ^ 

= E E E E •ai^ I.,. (-tJo ) "mS* 

3 = 1 {io,...,ij)eIj m>0\a\=7n ^3 ^ '^m/ 

i=0 k=l m>0 \a\=m ^ '^"i ^ 
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and the second moment of this absorption time is (see also [8]) 

/•OO 

HT^'^tliP))' = / t^<l>it,p)dt 
Jo 

f f°° d 
JVk Jo 



r/'OO 

dt 



i—Q k—1 m>0 \a\—m 

k 



EE EE eX.„...,,,^^?.w 



J^l {io,...,ij)eIj m>0 |a| 

71. Tl 



E E E E ^rn,a'^ln]a,i 
1=0 fc=l m>0 |Q|=m 



3.2. The probability distribution of the absorption time for having k + 1 
alleles. Wc note that Xj^fc+i^^j is a random variable valued in Vk- We consider 

the probability that this random variable takes its value in ^ [ q^ the 

probability of the population at the first time having at most k + 1 alleles to consist 
precisely of the k + 1 alleles {Aig, . . . , Ai^}. Let gk be a function of k variables 
defined inductively by 

92{p\p') =-^gi{p') + -^giip'y, 
1 — 1 — 

k+1 



^■gkip\...,p'-\f+\...,p'+') 



Then we shall have 
Theorem 3.1. 



Proof. Method 1: By proving that 



and elementary combinatorial arguments, we have immediately the result (see also 

H) 

Method 2: By proving that it is the unique solution of the classical Dirichlet 
problem 

C(^(.o,...,..))*^(p) ^Oin^fe 
limt;(p) =l,qeV^'°-'^\ 

limv{p) = 0,9 e dVk\V^''"-"'\Vk-i. 

p->-g 
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□ 



3.3. The probability of having exactly fc+1 alleles. The probability of having 
only 1 allele Ai (allele Ai is fix) is 



n 

m('^)>0/(fc)>0 |af'^)|^/(^) 



The probability of having exactly (k + 1) allele {Aq, . . . , Ak} (the coexistence prob- 
ability of alleles {Aq, . . . , Ak}) is (see also [H1[S]) 



( 



/(•O 



EEE 

m>0 i>0 \a\=l 



(•0, ■■■.•)=) 



3.4. The a*'* moments. The a*''-monients are (see also [4l[5l[6]) 

ma{t) =[u,x"]„ 

a;"u(x, i)(i/i(x) 



E E 



x"<'"-''^'(x,i)dMr'-"''^(x). 



A;=0(jo,...,ifc)e/fc (ij, 

3.5. The probability of heterogeneity. The probability of heterogeneity is (see 
also [5]) 

Ht ={n + 1)! [u,Wn]n 

={n + 1)! {un,Wn)n (bccausc Wn vanishes on the boundary) 

=(-+i)!(E E ^l^^le-^^-'-^s-n^o'; 

m>0 |a|=m 

=(n + 1)! (^c^qXq^qjWuXq^q^ e~^"'°* (because of the orthogonality of the eigenvectors X^^^) 

(n + l)(. + 2) ^ 

—Ho e 2 

3.6. The rate of loss of one allele in a population having fc + 1 alleles. We 

have the solution of the form 



^Wfc(x,t)xv,(x) 



fc=0 
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The rate of loss of one allele in a population with (k+1) alleles equals the rate of 
decrease of 

m>0 ;>0 \a\=l 

which is X^f^^ ^ Kk+}1^ rpj^ig 

means the rate of loss of alleles in the population 

decreases (see also [S]). 

Conclusion 

We have developed a new global solution concept for the Fokker-Planck equation 
associated with the Wright-Fisher model, and we have proved the existence and 
uniqueness of this solution (Theorem I2.9p . From this solution, we can easily read 
off the properties of the considered process, like the absorption time of having 
fc -I- 1 alleles, the probability of having exactly k + 1 alleles, the a*'' moments, the 
probability of heterogeneity, and the rate of loss of one allele in a population having 
fc -I- 1 alleles. 
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